When hosting a party or event, having uninterrupted music with smooth transitions is crucial. The Spotify Automix fuctionality can be a good solution, although not all playlists are suitable for this feature. Spotify offers pre-programmed playlists that include the Automix functionality, but users must configure their settings correctly to use it. To learn how to set up Automix functionality properly, refer to this link.
Despite its benefits, Spotify’s Automix functionality has some limitations compared to a regular DJ. Firstly, it lacks the complete experience of a DJ. Secondly, users cannot easily add songs to pre-programmed playlists. Lastly, there has been limited research on the techniques used by the Automix functionality in music studies. This raises the question of whether the techniques used by the Automix functionality are similar to those used by regular DJs or not.
This portfolio investigates the relationship between Spotify’s Automix functionality and the mix components used in the DJ industry for smooth song transitions.The Automix playlists used for this portfolio are HouseWerk, TechnoBunker, Mint and Dance Rising. The music contained in the playlist is best classified under House music. In subcategory the Housewerk playlist contains techhouse, TechnoBunker techno, DanceRising and Mint dance pop. The interest in this comparison came from being a DJ myself. In the DJ world, mainly 2 components are used that are important for a transition between two songs. These are the tempo of the songs and the key that the mixing songs are in. It will be investigated whether these components are also used by the automix functionality. This will focus on the order of the songs in the playlist. The shuffle functionality will be left out. It may also turn out that the functionality is completely random.
The first visualization shows the distribution over tempo of the
songs within the different playlists. In the visualization below, the x
axis shows the different playlists. On the y axis, the tempo is visible.
If we look closely at the visualization we see that the playlist
Technobunker has a high median compared to the other playlists. There
are also the fewest outliers here compared to the other playlists. The
spread of Housewerk is the smallest. This means that the tempo in this
playlist is mostly close to the median. This is a positive first
analysis on the research question.
The outliers negatively affect
overmixing. At least on a basic level of mixing. From the DJ industry,
songs with a maximum tempo difference of 6 bpm can be mixed. A so-called
Master Tempo function on the turntable ensures that the key does not
change at that moment. See a helpful explainer video here. When
the tempo difference is greater than 6 bpm, the listener will clearly
hear the difference in tempo. A possible solution to compensate for the
large bpm difference is to use tempo harmonics and subharmonics. For
example, a 120 BPM song can be mixed with the 60 BPM subharmonic. In the
following visualization it is examined whether this is used.
The second visualization shows the progression of tempo over the
playlist. On the x axis here the songs are numerically represented in
playlist order. On the y axis, the tempo is shown. The figures are
analyzed by playlist. The second visualization complements the first
visualization well. Both figures clearly show that Dance Rising and mint
have the most outliers. Looking at the order, the technique of
subharmonics (as discussed on the previous page) is probably used once.
In the dance rising playlist this can be seen in around track 45 and 46.
The tempo difference between track 45 (80 bpm) and track 46 (160 bpm).
Unfortunately, in cases of the large tempo difference, this technique is
not used more often. We can therefore already draw a first conclusion
that in these cases the transitions in terms of tempo are random.
The best build in tempo can be seen in the technobunker playlist. In
this visualization, the pace is clearly building and decreasing. There
are few peaks in the visualization, which means that techniques such as
subharmonics are not applicable. This is positive for the
transitions
In the DJ world, the easiest method to mix two songs is to keep the tempo the same throughout the song. When this is not the case, it requires an advanced technique to mix the songs together. The tempo analysis features in the turntable are now no longer accurate because this currently gives an average value of the tempo over time rather than the tempo per section in the song. The DJ must be familiar with this change in tempo in order to mix the song correctly. An example of an outlier in the corpus is the song Affraid to Feel by LF SYSTEM. The song was released on May 2, 2022. The song uses the sample from the song I Can’t Stop (Turning You On) by SILK. This song has a tempo of 85 bpm. So for LF SYSTEM it was a challenge to incorporate the sample into a disco track with a tempo of 130 BPM. This is evident from the tempogram. The song starts slowly with a tempo of 100 BPM. After 20 seconds, the tempo increases to 130 BPM. This is when song switches from the intro to the chorus. What is not immediately clear is that after the chorus, the tempo drops slightly. After repeated listens, this became audible. From the tempogram it can be seen that after the first chorus, the chorus returns 2 more times. The last thing that stands out is that after about 150 seconds the tempo is classified as rapidly increasing. In the recording it can be heard that the kick drum changes from every four-quarter measure to every 8th measure. This makes the tempo seem to double but is not actually the case. It is the lead-up to the chorus.
In this section, we chose to compare two songs from the playlist Housewerk. The songs chosen are listed in order in the playlist. This is to see if it becomes clear if certain chords are used in the songs that go well together. This makes the transition between the songs sound better. Starting with the first track, the song No Diggity produced by Häwk and Beyge was chosen. The song is a cover of of the original song written by Blackstreet, Dr. Dre and Queen Pan that was released in 1996. The song is played primarily in C# and F#. Chebyshev was chosen as the normalization. This is because with the euclidean and manhattan it was not clearly visible which notes were used.The second song analyzed is the song Ritmo by Raffa FL. It is a cover of the song La Colegiala by The Boy Next Door, Fresh Coast and Jody Bernal. From the chromogram, we notice that the song is played in F, G and Gm. Euclidean was chosen as the normalization. This is because with this method the magnitude (in yellow) was the clearest. With confirmation from the spotify API, it is clear that these two songs can be well mixed together.
In this section, we chose to compare two songs from the playlist Housewerk. The songs chosen are listed in order in the playlist. This is to see if it becomes clear if certain chords are used in the songs that go well together. This makes the transition between the songs sound better. Starting with the first track, the song No Diggity produced by Häwk and Beyge was chosen. The song is a cover of of the original song written by Blackstreet, Dr. Dre and Queen Pan that was released in 1996. The song is played primarily in C# and F#. Chebyshev was chosen as the normalization. This is because with the euclidean and manhattan it was not clearly visible which notes were used.The second song analyzed is the song Ritmo by Raffa FL. It is a cover of the song La Colegiala by The Boy Next Door, Fresh Coast and Jody Bernal. From the chromogram, we notice that the song is played in F, G and Gm. Euclidean was chosen as the normalization. This is because with this method the magnitude (in yellow) was the clearest. With confirmation from the spotify API, it is clear that these two songs can be well mixed together.
Above are the similairty matrices of the chroma features (left) and timbre (right). The song chosen is La Fuente - I Want You. The song is in key 8, the tempo is 125 bpm and energy is 0.8. We analyze the figure using 3 criteria. Representation (shows the repeating elements with diagonal sloping lines), novelty (bright yellow lines when a part is not similar and dark lines when it is) and lastly homogeneity (A so-called chessboard that when it is dark blue shows that everything is similar and when it is more yellow more difference). Starting with the representation, it is difficult to find the diagonal lines. They can be seen very slightly at the end of the song Looking at the novelty, it is clear that the song has 2 refrains. These are the intersections of the yellow lines around 40 and 190 seconds. At that point the song changes tremendously. Looking last at the homogeneity, the chessboards are mainly found between 50 and 100 seconds.
Looking at the second song, we look at the analysis of the song Tom Santa - Rainfall. This is a cover of the song Shackles released by Mary Mary. The song is in key 5, the tempo is 128 bpm and the energy is almost 0.9. Like the song by La Fuente, I analyze this song using the 3 characteristics of representation, novelty and homogeneity. Starting with representation, we see the diagonal dark lines a lot in the song. During the beginning of the song until 30 seconds it is clearly visible. Looking at the novelty, a large intersection can be found around 90 seconds. You can clearly hear the transition in the song. Lastly looking at the homogeneity, the chess boards are also well found in this song. Especially some smaller chessboards between 25 and 90 seconds. Because these are dark blue chessboards, the song changes a lot in this time interval
Using random forest, an attempt was made to obtain the most important features from the playlists. Due to the size of the playlists, the decision was made to focus only on the first 20 songs of each playlist. In order to get an overall view, the choice was made to use as many features as possible. The visualization shows that tempo is most important. This meets the expectations. What is interesting is that the 6th coefficient of the timbre vector is almost as important as tempo. It is difficult to determine what the 6th dimension of the timbre vector means. The documentation on this vector is quite limited. The rest of the features decrease gradually.
To improve the number of good transitions, I want to show a possible solution using clustering to better determine the order of the playlist. The best features from previous page (tempo and c6) were used to create this clustering. Single linkage was chosen where the smallest distance between clusters is calculated. The choice of this form has two reasons. First, outliers can then be discovered first. These songs can then potentially be used at the beginning of the playlist because the user starts the song at that point. Second, clusters are now created where the distance between songs is smallest. An order of songs in the playlist can now be easily created.
The conclusion summarizes the results and answers the research
question posed in the introduction.
The introduction introduced the
research question: “Does the spotify automix functionality use the same
techniques as a regular DJ?”. In this portfolio, visualizations were
used to try to answer this question.
Starting with the
visualizations on tempo. From the visualizations, it can be seen that
the Dance Rising and mint playlist contain the most outliers in terms of
tempo. This negatively effects the overmixing of the songs. Housework
and Technobunker are the same in terms of tempo. The songs are in the
same tempo range and few outliers can be found there. In addition, there
is a single song that makes the transition tremendously difficult as
seen in the last visualization in terms of tempo. In the song Affraid to
feel by LF SYTEMS, the tempo chart showed that the song consists of
different tempos that does not make the transition to another song
easier.
In the visuals about key, the first insight was generated
into how many songs were properly over-mixed based on key. This number
was low for all four playlists. Using one example, a good transition was
shown and using another example, a bad transition was shown. Using the
circle of fifth’s, the difference between a good and a bad transition
became easily visible.
From the energy and valence figure, it could
be seen that the playlists are very energetic and the valence is very
scattered. From the loudness, few outliers could be found. The biggest
finding found in this visualization is that the technobunker playlist
consists primarily of songs in major.
The Chromograms showed that
by eye it is very difficult to tell which chords were used in songs.
Therefore, the least data was obtained from this to answer the research
question.
The similarity matrices were interesting. It was clear
from the first figure that La Fuente’s song I Want You contained
multiple choruses but representation was difficult to see. In contrast,
the second visualization showed more insight on novelty. The
visualizations showed good insight into repetitions and changes in the
song. The visualizations did not have a major impact on answering the
research question but showed good insights.
The last visualizations
in this portfolio regarding the clustering summarize the experiences
made in the portfolio. The portfolio showed that the most information
regarding the research question could be obtained from the tempo and key
components. As mentioned earlier in this conclusion, the transitions
made by Spotify’s Automix functionality are not yet optimal. By means of
clustering a method is shown how feature selection and tree’s can be
used to adjust the order within the playlist in order to improve the
number of transitions.
To summarize, we can answer the research question. The tempo and key features show that Spotify makes little use of these features to overmix the songs. A limited number of songs are overmixed well, but that could also be a coincidence. Track popularity will probably have a heavier consideration in the order in which spotify delivers the playlist.In addition, it has also been found that features such as mode, loudness, energy and valence can tell little about the overmix technique. However, these features have provided more insight into the data. From the clustering visualizations, spotify can make a consideration to improve the transitions and enhance the listener’s listening experience.
There are a number of discussion points that need to be discussed.
First, the playlist changed over time. Since these were not their own playlists, Spotify changed the playlist every week. As a result, the research result has changed slightly several times. However, the changes are not worth mentioning because most songs remained in the playlist and a single new song was added.
Secondly, Spotify is all about the popularity of a playlist and less about the automix functionality. As a result, the quality of the transitions will weigh less heavily. Spotify may be able to expand playlists where the transition is more important.
Finally, this research was done on only 4 Spotify playlists. For a higher reliability and a more diverse answer, this research could be repeated on the other playlists that contain this functionality.